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Abstract 

Regenerating codes are a class of distributed storage codes that optimally trade the bandwidth needed for repair 

^^ of a failed node with the amount of data stored per node of the network. Minimum Storage Regenerating (MSR) 

r— V codes minimize first, the amount of data stored per node, and then the repair bandwidth, while Minimum Bandwidth 

psj Regenerating (MBR) codes carry out the minimization in the reverse order An [n, fc, d\ regenerating code permits 

^ the data to be recovered by connecting to any k of the n nodes in the network, while requiring that repair of a 

C^ failed node be made possible by connecting (using links of lesser capacity) to any d nodes. Previous, explicit and 

*~~5 general constructions of exact-regenerating codes have been confined to the case n = d+1. 

f^ In this paper, we present optimal, explicit constructions of MBR codes for all feasible values of [n, k, rf] and 

C^ MSR codes for all [n, k, d > 2k — 2], using a product-matrix framework. The particular product-matrix nature of 

^^ the constructions is shown to significantly simplify system operation. To the best of our knowledge, these are the 

Li^ first constructions of exact-regenerating codes that allow the number n of nodes in the distributed storage network, 

HH to be chosen independent of the other parameters. 

C/3 The paper also contains a simpler description, in the product-matrix framework, of a previously constructed 

I ^1 MSR code in which the parameter d satisfies [n = d+ 1, k, d > 2k — \\. 
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I. Introduction 



In a distributed storage system, information pertaining to a data file (the message) is dispersed across nodes in 
a network in such a manner that an end-user can retrieve the data stored by tapping into a subset of the nodes. A 
lO popular option that reduces network congestion and that leads to increased resiliency in the face of node failures, is 
O to employ erasure coding, for example by calling upon maximum-distance-separable (MDS) codes such as Reed- 
Solomon (RS) codes. Let B be the total file size measured in terms of symbols over a finite field ¥q of size q. With 
RS codes, data is stored across n nodes in the network in such a way that the entire message can be recovered 
by a data collector by connecting to any k nodes, a process that we will refer to as data-reconstruction. Several 
rN distributed storage systems such as RAID-6 yj, OceanStore [2| and Total Recall |[3j| employ such an erasure-coding 
Oj option. 

A. Regenerating Codes 

Upon failure of an individual node, a self-sustaining data-storage network must necessarily possess the ability 
to regenerate (i.e., repair) a failed node. An obvious means to accomplish this is to permit the replacement node 
to connect to any k nodes, download the entire message, and extract the data that was stored in the failed node. 
But downloading the entire B units of data in order to recover the data stored in a single node that stores only a 
fraction of the entire message is wasteful, and raises the question as to whether there is a better option. Such an 
option is indeed available and provided by the concept of a regenerating code introduced in the pioneering paper 
by Dimakis et al. |j4j. 
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Conventional RS codes treat each fragment stored in a node as a single symbol belonging to the finite field 
Fg. It can be shown that when individual nodes are restricted to perform only linear operations over Fg, the total 
amount of data download needed to repair a failed node, can be no smaller than B, the size of the entire file. In 
contrast, regenerating codes are codes over a vector alphabet and hence treat each fragment as being comprised of 
a symbols over the finite field Fg. Linear operations over Fg in this case, permit the transfer of a fraction of the 
data stored at a particular node. Apart from this new parameter a, two other parameters d and f3 are associated 
with regenerating codes. Under the definition of regenerating codes introduced in ||4j|, a failed node is permitted to 
connect to an arbitrary set of d of the remaining nodes while downloading /3 < a symbols from each node. This 
process is termed as regeneration and the total amount dji of data downloaded for repair purposes as the repair 
bandwidth. Further, the set of d nodes aiding in the repair are termed as helper nodes. Typically, with a regenerating 
code, the average repair bandwidth is small compared to the size of the file B. 

It will be assumed throughout the paper, that whenever mention is made of an [n, k, d] regenerating code, the 
code is such that k and d are the minimum values under which data-reconstruction and regeneration can always be 
guaranteed. This restricts the range of d to 

k<d<n-l . (1) 

The first inequality arises because if the regeneration parameter d were less than the data-reconstruction parameter 
k then one could, in fact, reconstruct data by connecting to any d < k nodes (treating the data stored in every 
other node as a function of that stored in these d nodes) thereby contradicting the minimality of k. Finally, while 
a regenerating code over Fg is associated with the collection of parameters 

{n, fc, d, a, /3, B} , 

it will be found more convenient to regard parameters {n, k, d} as primary and {a, /3, B} as secondary and thus 
we will make frequent references in the sequel, to a code with these six parameters as an [n, k, d] regenerating 
code having parameter set (a, /3, B). 

B. Cut-Set Bound and the Storage vs Repair-Bandwidth Tradeoff 

A major result in the field of regenerating codes is the proof in |[5| that uses the cut-set bound of network coding 
to establish that the parameters of a regenerating code must necessarily satisfy : 

fc-i 
B < Y^ min{a, {d - i)/3} . (2) 

It is desirable to minimize both a as well as /3 since, minimizing a results in a minimum storage solution, 
while minimizing /3 (for fixed d) results in a storage solution that minimizes repair bandwidth. As can be deduced 
from Q, it is not possible to minimize both a and /3 simultaneously and thus there is a tradeoff between choices 
of the parameters a and /3. The two extreme points in this tradeoff are termed the minimum storage regeneration 
(MSR) and minimum bandwidth regeneration (MBR) points respectively. The parameters a and /3 for the MSR 
point on the tradeoff can be obtained by first minimizing a and then minimizing /3 to obtain 

B 

" = ¥' 

^ " k{d-k + i) ■ ^^^ 

Reversing the order, leads to the MBR point which thus corresponds to 



k{2d-k + l) 



2dB 

a = . (4) 

k{2d-k + l) ^' 

We define an optimal [n, k, d] regenerating code as a code with parameters (a, /3, B) satisfying the twin 



requirements that 

1) the parameter set (a, /3, B) achieves the cut-set bound with equaUty and 

2) decreasing either a or /3 will result in a new parameter set that violates the cut set bound. 

An MSR code is then defined as an [n, k, d\ regenerating code whose parameters (a, f3, B) satisfy Q and 
similarly, an MBR code as one with parameters (a, /?, B) satisfying (|4]). Clearly, both MSR and MBR codes are 
optimal regenerating codes. 

C. Striping of Data 

The nature of the cut-set bound permits a divide-and-conquer approach to be used in the application of optimal 
regenerating codes to large file sizes, thereby simplifying system implementation. This is explained below. 

Given an optimal [n, A;, d] regenerating code with parameter set {a,(3,B), a second optimal regenerating code 
with parameter set (a = 6a, /? = 6(3, B = 6B) for any positive integer 6 can be constructed, by dividing the 6B 
message symbols into 6 groups of B symbols each, and applying the (a, /3, B) code to each group independently. 
Secondly, a common feature of both MSR and MBR regenerating codes is that in either case, their parameter set 
(a, P, B) is such that both a and B are multiples of (3 and further that ^, 4 are functions only of n, k and d. It 
follows that if one can construct an (optimal) [n, k, d] MSR or MBR code with /3 = 1, then one can construct 
an (optimal) [n, k, d] MSR or MBR code for any larger value of /3. In addition, from a practical standpoint, 
a code constructed through concatenation of codes for a smaller /3 will in general, be of lesser complexity (see 



Section VI-Cl. For these reasons, in the present paper we design codes for the case of /3 = 1. Thus, throughout 
the remainder of the paper, we will assume that /? = 1. In the terminology of distributed storage, such a process 
is called striping. 

We document below the values of a and B of MSR and MBR codes respectively, when (3 = 1: 

a = d-k + 1 , (5) 

B = k{d-k + l) (6) 

for MSR codes and 

a = d , (7) 

B = kd-(^^ (8) 

in the case of MBR codes. 

D. Additional Terminology 

a) Exact versus Functional Regeneration: In the context of a regenerating code, by functional regeneration 
of a failed node /, we will mean, replacement of the failed node by a new node /' in such a way that following 
replacement, the resulting network of n nodes continues to possess the data-reconstruction and regeneration prop- 
erties. In contrast, by exact-regeneration, we mean replacement of a failed node / by a replacement node /' that 
stores exactly the same data as was stored in node /. We will use the term exact-regenerating code to denote a 
regenerating code that has the capability of exactly regenerating each instance of a failed node. Clearly where it is 
possible, an exact-regeneration code is to be preferred over a functional-regeneration code since, under functional 
regeneration, there is need for the network to inform all nodes in the network of the replacement, whereas this is 
clearly unnecessary under exact-regeneration. 

b) Systematic Regenerating Codes: A systematic regenerating code can be defined as a regenerating code 
designed in such a way that the B message symbols are explicitly present amongst the ka code symbols stored 
in a select set of k nodes, termed as the systematic nodes. Clearly, in the case of systematic regenerating codes, 
exact-regeneration of (the systematic portion of the data stored in) the systematic nodes is mandated. 



c) Linear Regenerating Codes: A linear regenerating code is defined as a regenerating code in which 

1) the code symbols stored in each node are linear combinations over Fg of the B message symbols {ui}f^^, 

2) the symbols passed by a helper node h to aid in the regeneration of a failed node / are linear over ¥q in the 
a symbols stored in node h. 

It follows as an easy consequence, that linear operations suffice for a data collector to recover the data from 
the ka code symbols stored in the k nodes that it has connected to. Similarly, the replacement node for a failed 
node /, performs linear operations on the d symbols passed on to it by the d helper nodes {hi}'^^^ aiding in the 
regeneration. 

E. Results of the Present Paper 

While prior work is described in greater detail in Section |ll| we begin by providing a context for the results 
presented here. 

Background: To-date, explicit and general constructions for exact-regenerating codes at the MSR point have 
been found only for the case [n = d + 1, k, d > 2k — 1]. Similarly at the MBR point, the only explicit code to 
previously have been constructed is for the case d = n — 1. Thus, all existing code constructions limit the total 
number of nodes n in the system to ci+ 1. This is restrictive since in this case, the system can handle only a single 
node failure at a time. Also, such a system does not permit additional storage nodes to be brought into the system. 

A second open problem in this area that has recently drawn attention is as to whether or not the storage -repair 
bandwidth tradeoff is achievable under the additional requirement of exact-regeneration. It has previously been 
shown that the MSR point is not achievable for any [n, k, d < 2k — 3] with /3 = 1, but is achievable for all 
parameters [n, k, d] when B (and hence f3 as well) is allowed to approach infinity. 

Results Presented in Present Paper: In this paper, (optimal) explicit constructions of exact-regenerating MBR 
codes for all feasible values of [n, k, d] and exact-regenerating MSR codes for all [n, k, d >2k — 2] are presented. 
The constructions are of a product-matrix nature that is shown to significantly simplify operation of the distributed 
storage network. The constructions presented prove that the MBR point for exact-regeneration can be achieved for 
all values of the parameters and that the MSR point can be achieved for all parameters satisfying d > 2k — 2. In 
both constructions, the message size is as dictated by cut-set bound. The paper also contains a simpler description, 
in the product-matrix framework, of an MSR code for the parameters [n = d+1, k, d>2k—l] that was previously 
constructed in Q, (TJ. 

A brief overview of prior work in this field is provided in the next section. Section |ll] The product-matrix 
framework underlying the code construction is described in Secti on [Till An exact-regenerating MBR code for all 
feasible values of the parameters [n, k, d] is presented in Section |IV[ and an exact-regenerating MSR code for all 
[n, k, d > 2k — 2] is presented in Section M Implementation advantages of the particular product-matrix nature of 
the code constructions provided here are described in Section VI The final section. Section [VTH draws conclusions. 



Appendix [A] contains a simpler description, in the product-matrix framework, of an MSR code with parameter d 
satisfying [n = d+ I, k, d> 2k — 1], that was previously constructed in ||6|, p\. 

II. Prior Work 

The concept of regenerating codes was introduced in f4], where it was shown that permitting the storage nodes to 
store more than B/k units of data helps in reducing the repair bandwidth. Several distributed systems were analyzed, 
and estimates of the mean node availability in such systems obtained. Using these values, it was shown through 
simulation, that regenerating codes can reduce repair bandwidth compared to other designs, while simplifying 
system architecture. 

The problem of minimizing repair bandwidth for functional repair of a failed storage node is considered in PI, 
Q. Here, the evolution of the storage network through a sequence of failures and regenerations is represented as a 
network, with all possible data collectors represented as sinks. The data-reconstruction requirement is formulated as 
a multicast network coding problem, with the network having an infinite number of nodes. The cut-set analysis of 



this network leads to the relation between the parameters of a regenerating code given in equation Q. It can be seen 
that there is a tradeoff between the choice of the parameters a and /3 for a fixed B and this is termed as the storage- 
repair bandwidth tradeoff. It has been shown (||5J, |[8J) that this tradeoff is achievable under functional-regeneration. 
However, the coding schemes suggested are not explicit and require large field size. The journal version \^ also 
contains a handcrafted functional-regenerating code for the MSR point with [n = 4, k = 2, d = 3]. 



A study of the computational complexity of regenerating codes is carried out in 1 10|, in the context of random 
linear regenerating codes that achieve functional repair. 

The problem of exact-regeneration was first considered independently in pT] |, ^T\ and [13|. In [TT|, it is shown 
that the MSR point is achievable under exact-regeneration when (A; = 2, d = n — 1). The coding scheme proposed 
is based on the concept of interference alignment developed in the context of wireless communication. However, 
the construction is not explicit and has a large field size requirement. In flS], the authors carry out a computer 
search to find exact-regenerating codes at the MSR point, resulting in identification of codes with parameters 
[n = 5, k = 3, d = 4]. 

The first, explicit construction of regenerating codes for a general set of parameters was provided for the MBR 
point in [12] with d = n — I and arbitrary k. These codes have low regeneration complexity as no computation is 



involved during the exact-regeneration of a failed node. The field size required is of the order of n^. In addition 1 12 1 
(see also fT4l) also contains the construction of an explicit MSR code ior d = k + 1, that performs approximately- 
exact-regeneration of all failed nodes, i.e., regeneration where a part of the code is exactly regenerated, and the 
remaining is functionally regenerated (it is shown subsequently in |[6|, | |T4| that exact-regeneration is not possible, 
when /c > 4, for the set of parameters considered therein). 



MSR codes performing a hybrid of exact and functional-regeneration are provided in |15|, for the parameters 
d = k + 1 and n > 2k. The codes given even here are non-explicit, and have high complexity and large field-size 
requirement. 

A code structure that guarantees exact-regeneration of just the systematic nodes is provided in |6|, for the MSR 
point with parameters [n = d + 1, k, d > 2k — 1]. This code makes use of interference alignment, and is termed 
as the 'MISER' code in journal-submission version |14] of [6|. Subsequently, it was shown in Q that for this 
set of parameters, the code introduced in |6| for exact-regeneration of only the systematic nodes can also be used 
to repair the non-systematic (parity) node failures exactly provided repair construction schemes are appropriately 
designed. Such an explicit repair scheme is indeed designed and presented in ||7|. The paper |[7| also contains an 
exact-regenerating MSR code for parameter set [n = 5, k = 3, d = 4]. 

A proof of non-achievability of the cut-set bound on exact-regeneration at the MSR point, for the parameters 



[n, k, d < 2k — 3] when /3 = 1, is provided in 1 14 1. On the other hand, the MSR point is shown to be achievable 



in the hmiting case of B approaching infinity (i.e., /3 approaching infinity) in | ,16J , | ,17| . 



A flexible setup for regenerating codes is described in |18|, where a data collector (or a replacement node) can 
perform data-reconstruction (or regeneration) irrespective of the number of nodes to which it connects, provided 
the total data downloaded exceeds a certain threshold. 



In 1 19 1, the authors establish that essentially all points on the interior of the tradeoff (i.e., points other than MSR 



and MBR) are not achievable under exact-regeneration. 

HI. The Common Product-Matrix Framework 

The constructions described in this paper, follow a common product-matrix framework. Under this framework, 
each codeword in the distributed storage code can be represented by an (n x a) code matrix C whose i*^ row c^ 
contains the a symbols stored by the i^^ node. Each code matrix is the product 

C = ^M (9) 

of an (n x d) encoding matrix ^ and an {d x a) message matrix M. The entries of the matrix ^ are fixed a priori 
and are independent of the message symbols. The message matrix M contains the B message symbols, with some 
symbols possibly repeated. We will refer to the i*^ row ip^. of ^ as the encoding vector of node i as it is this vector 



that is used to encode the message into the form in which it is stored within the i^^ node: 

c* = i/M , (10) 

where the superscript 't' is used to denote the transpose of a matrix. Throughout this paper, we consider all symbols 
to belong to a finite field Wg of size q. 

This common structure of the code matrices leads to common architectures for both data-reconstruction and 
exact-regeneration, as explained in greater detail below. It also endows the codes with implementation advantages 
that are discussed in Section |Vll 

Data-reconstruction amounts to recovering the message matrix M from the ka symbols obtained from an arbitrary 

set of k storage nodes. Let us denote the set of k nodes to which the data collector connects as {ii, . . . ,ik}- The 

j^'^ node in this set passes on the message vector i/j^. M to the data collector. The data collector thus obtains the 

— ^j 
product matrix 

where ^dc is the submatrix of ^ consisting of the k rows {ii, . . . , z^}. It then uses the properties of the matrices ^ 
and M to recover the message. The precise procedure for recovering M is a function of the particular construction. 

As noted above, each node in the network is associated to a distinct (dx 1) encoding vector t/j ..In the regeneration 
process, we will need to call upon a related vector fi. of length a, that contains a subset of the components oi tp.. 
To regenerate a failed node /, the node replacing the failed node connects to an arbitrary subset {hi, ... , h^} of d 
storage nodes which we will refer to as the d helper nodes. Each helper node passes on the inner product of the 
a symbols stored in it with /x , to the replacement node: the helper node /i, passes 



-/ 



V^l/^^/ • 



The replacement node thus obtains the product matrix 

^repairM//^ , 

where ^repah- is the submatrix of ^ consisting of the d rows {hi, ... , h^]. From this it turns out, as will be shown 
subsequently, that one can recover the desired symbols. Here again, the precise procedure is dependent on the 
particular construction. 

Remark 1: An important feature of the product-matrix construction presented here, is that each of the nodes hj 
participating in the regeneration of node /, needs only have knowledge of the encoding vector of the failed node / 
and not the identity of the other nodes participating in the regeneration. This significantly simplifies the operation 
of the system. 

Systematic Codes: The following theorem shows that any linear exact-regenerating code can be converted to a 
systematic form via a linear remapping of the symbols. The proof of the theorem may be found in Appendix [Bj 

Theorem 1: Any linear exact-regenerating code can be converted to a systematic form via a linear remapping 
of the message symbols. Furthermore, the resulting code is also linear and possesses the data-reconstruction and 
exact-regeneration properties of the original code. 

Thus, all codes provided in the present paper can be converted to a systematic form via a linear remapping of 
the message symbols. Specific details on the product-matrix MBR and MSR codes in systematic form are provided 
in the respective sections. Sections IV and [V] 



IV. The Product-Matrix MBR Code Construction 

In this section, we identify the specific make-up of the encoding matrix ^ and the message matrix M that results 
in an [n, k,d] MBR code with /3 = 1. A notable feature of the construction is that it is applicable to all feasible 
values of [n, k, d], i.e., all n, k, d satisfying k < d < n — 1. Since the code is required to be an MBR code with 
/3 = 1, it must possess the data-reconstruction and exact-regeneration properties required of a regenerating code, 
and in addition, have parameters {a, B} that satisfy equations (|7]) and ([8]l. Equation ^ can be rewritten in the 



form: 



B 



k + l 
2 



+ k{d-k) 



Thus the parameter set of the desired [n, k, d] MBR code is (a = d, /? = 1, i? = ( ^ ) + k{d — k)). 

Let 5 be a (fc X fc) matrix constructed so that the ( ^ ) entries in the upper- triangular half of the matrix are filled 
up by ( ^ ) distinct message symbols drawn from the set {ui}f^i. The (g) entries in the strictly lower-triangular 
portion of the matrix are then chosen so as to make the matrix S a symmetric matrix. The remaining k{d — k) 
message symbols are used to fill up a second {k x {d — k)) matrix T. The message matrix M is then defined as 
the {d X d.) symmetric matrix given by 

S T 
T* 



M 



(11) 



The symmetry of the matrix will be found to be instrumental when enabling node repair. Next, define the encoding 
matrix \I' to be any (n x d) matrix of the form 

*= [ $ A ] , 

where <1> and A are (n x k) and {n x {d — k)) matrices respectively, chosen in such a way that 

1) any d rows of ^ are linearly independent, 

2) any k rows of <l> are linearly independent. 

The above requirements can be met, for example, by choosing ^ to be either a Cauchy |20| or else a Vandermonde 
matrix|^ As per the product-matrix framework, the code matrix is then given by C = ^M. 

The two theorems below establish that the code presented is an [n, k, d] MBR code by establishing respectively, 
the exact-regeneration and data-reconstruction properties of the code. 

Theorem 2 (MBR Exact-Regeneration): In the code presented, exact-regeneration of any failed node can be 
achieved by connecting to any d of the (n — 1) remaining nodes. 

Proof: Let ip^ be the row of ^ corresponding to the failed node /. Thus the d symbols stored in the failed 
node correspond to the vector 



^^M. 



(12) 



The replacement for the failed node / connects to an arbitrary set {hj \ j = I, . . . ,d} oi d helper nodes. Upon 
being contacted by the replacement node, the helper node hj computes the inner product 

and passes on this value to the replacement node. Thus, in the present construction, the vector /i equals i/j itself. 
The replacement node thus obtains the d symbols ^repair AfV' , from the d helper nodes, where 



^ 



repair 



4^ 



i^i 



-hd 

By construction, the {d x d) matrix ^repair is invertible. Thus, the replacement node recovers Mip through 
multiplication on the left by ^^epair- Since M is symmetric, 

and this is precisely the data previously stored in the failed node. 

'Over a large finite field, every matrix ^ in general position will also satisfy these requirements. 



t]M , 



(13) 



Theorem 3 (MBR Data-Reconstruction): In the code presented, all the B message symbols can be recovered by 
connecting to any k nodes, i.e., the message symbols can be recovered through linear operations on the entries of 
any k rows of the matrix C. 

Proof: Let 

^DC = [ ^DC Adc ] (14) 

be the {k x d) submatrix of ^, corresponding to the k rows of ^ to which the data collector connects. Thus the 
data collector has access to the symbols 

^DCM = [ $Dc5 + AdcT* ^dcT ] . (15) 

By construction, ^^c is a non-singular matrix. Hence, by multiplying the matrix ^dc-^^ on the left by ^^, one 
can recover first T and subsequently, S. ■ 

A. An Example for the Product-Matrix MBR Code 

Let n = 6, A: = 3, d = A. Then a = d = 4 and B = 9. Let us choose g = 7 so we are operating over Fy. The 
matrices S and T are filled up by the 9 message symbols {ui}^^^ as shown below: 



S 



so that the message matrix M is given by 
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(16) 
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We choose ^ to be the (6 x 4) Vandermonde matrix over F7 given by 
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(18) 



Fig. [T] shows at the top, the (6 x 4) code matrix C = ^M with entries expressed as functions of the message 
symbols {uj}?^^. The rest of the figure explains how exact-regeneration of failed node 1 takes place. To regenerate 
node 1, the helper nodes (nodes 2,4,5,6 in the example), pass on their respective inner products if^^Mll 1 1 1]* 



for £ = 
where 



zj,^ 



2, 4, 5, 6. The replacement node then recovers the data stored in the failed node by multiplying by ^ 
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(19) 



as explained in the proof of Theorem |2] above. 



B. Systematic Version of the Code 



As pointed out in Section |III| any exact-regenerating code can be made systematic through a non-singular 
transformation of the message symbols. In the present case, there is a simpler approach, in which the matrix ^ can 
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Fig. 1: An example for the MBR code construction: On failure of node 1, the replacement node downloads one 
symbol each from nodes 2, 4, 5 and 6, using which node 1 is exactly regenerated. The notation < ., 1 > indicates 
an inner product of the stored symbols with the vector [1 1 1 1]*. 



be chosen in such a way that the code is automatically systematic. We simply make the choice: 



h 





A 



(20) 



where I^ is the {k x k) identity matrix, is a (A; x (d — A;) zero matrix, $ and A are matrices of sizes ((n — k)xk) 

Clearly the code is systematic. It can 



and {{n — k) x {d — k)) respectively, such that 



$ A 



is a Cauchy matrix 



be verified that the matrix ^ has the properties listed just above Theorem 2 



V. The Product-Matrix MSR Code Construction 

In this section, we identify the specific make-up of the encoding matrix ^ and the message matrix M that results 
in an [n, k, d] MSR code with /3 = 1. The construction applies to all parameters [n, k, d > 2A; — 2]r|Since the code 
is required to be an MSR code with /3 = 1, it must possess the data-reconstruction and exact-regeneration properties 
required of a regenerating code, and in addition, have parameters {a, B} that satisfy equations Q and Q. We 
begin by constructing an MSR code in the product-matrix format for d = 2k — 2 and will show in Section V-C 
how this can be very naturally extended to yield codes with d > 2k — 2. 



At the MSR point with d = 2k - 2 we have 



and hence 



Also, 



a = d — k + l = k — 1, 



d = 2a . 



B = ka = a{a + 1) . 



We define the {d x a) message matrix M as 



M 



Si 
S2 



(21) 
(22) 

(23) 
(24) 



where ^i and 52 are {a x a) symmetric matrices constructed such that the ("^ ) entries in the upper-triangular 
part of each of the two matrices are filled up by ("J ) distinct message symbols. Thus, all the B = a{a + 1) 

^In general, any matrix, all of whose submatrices are of full rank, will suffice. 

^As mentioned previously, it is impossible to construct linear MSR codes for the case of d < 2fc — 3 when /3 = 1. 
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message symbols are contained in the two matrices ^i and 5*2. The entries in the strictly lower- triangular portion 
of the two matrices Si and S2 are chosen so as to make the matrices 6*1 and 5*2 symmetric. 

Next, we define the encoding matrix ^ to be the (n x d) matrix given by 

$ = [$ A$] , (25) 

where $ is an (n x a) matrix and A is an (n x n) diagonal matrix. The elements of $ are chosen such that the 
following conditions are satisfied: 

1) any d rows of 'I' are linearly independent, 

2) any a rows of <l> are linearly independent and 

3) the n diagonal elements of A are distinct. 

The above requirements can be met, for example, by choosing ^ to be a Vandermonde matrix with elements 
chosen carefully to satisfy the third condition. Then under our code-construction framework, the i*^ row of the 
(n X a) product matrix C = ^M, contains the a code symbols stored by the i*^ node. 

The two theorems below establish that the code presented is an [n,k, d] MSR code by establishing respectively, 
the exact-regeneration and data-reconstruction properties of the code. 

Theorem 4 (MSR Exact-Regeneration): In the code presented, exact-regeneration of any failed node can be 
achieved by connecting to any d = 2k — 2 of the remaining [n — 1) nodes. 



Proof: Let 
failed node were 



/)* Ay(/)* be the row of ^ corresponding to the failed node. Thus the a symbols stored in the 



l\ Xf^\ 



M 



Si + A/^* 5*2 



(26) 



The replacement for the failed node / connects to an arbitrary set {hj \ j = 1, . . . ,d} of d helper nodes. Upon 
being contacted by the replacement node, the helper node hj computes the inner product ip*. Mcj) and passes on 
this value to the replacement node. Thus, in the present construction, the vector /i equals cj) . The replacement 
node thus obtains the d symbols ^repair^'/'r from the d helper nodes, where 



/ 



* 



repair 






L^l 



By construction, the {d x d) matrix ^repair is invertible. Thus the replacement node now has access to 

\Si<P 



M(j) 



f 



5*2' 



As 5*1 and S2 are symmetric matrices, the replacement node has thus acquired through transposition, both </>* 5i 



and (/)* 5*2. Using this it can obtain 



b'Si + Xfl\S2 , 



(27) 



which is precisely the data previously stored in the failed node. ■ 

Theorem 5 (MSR Data-Reconstruction): In the code presented, all the B message symbols can be recovered by 
connecting to any k nodes, i.e., the message symbols can be recovered through linear operations on the entries of 
any k rows of the code matrix C. 

Proof: Let 

^DC = [^DC AdC<^Dc] (28) 

be the {k x d) submatrix of ^, containing the k rows of ^ which correspond to the k nodes to which the data 
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collector connects. Hence the data collector obtains the symbols 



Si 
S2 



[$DcSi + Adc^dc52] • 



The data collector can post-multiply this term with ^^ to obtain 



[^DcSl + Adc^dc52] $dc 

Next, let the matrices P and Q be defined as 



T 



P = <^ocSi<^ 
Q = ^ncS2^l 



(29) 

(30) 

(31) 
(32) 



As Si and 5*2 are symmetric, the same is true of the matrices P and Q. In terms of P and Q, the data collector 
has access to the symbols of the matrix 

P + AucQ . (33) 

The (i, j)*, I < i,j < k, element of this matrix is 



while the {j, i)* element is given by 






(34) 



(35) 



where equation (35 1 follows from the symmetry of P and Q. By construction, all the Aj are distinct and hence 
using (34 1 and (35 1, the data collector can solve for the values of Pij, Qij for all i / j. 



Consider first the matrix P. Let $dc be given by 



$r 



6* 

-a+l 



(36) 



All the non-diagonal elements of P are known. The elements in the i*^ row (excluding the diagonal element) are 
given by 



to 

' ^ ^-1 :^j-i :Li+i :La+i 
However, the matrix to the right is non-singular by construction and hence the data collector can obtain 



^i I 1 < i < a + 1 



}■ 



Selecting the first a of these, the data collector has access to 

Si 



(37) 



(38) 



(39) 



The matrix on the left is also non-singular by construction and hence the data collector can recover ^i. Similarly, 
using the values of the non-diagonal elements of Q, the data collector can recover 82- ■ 

Remark 2: It is shown in fSl, fT?^ that interference alignment is, in fact, a necessary ingredient of any minimum 
storage regenerating code. Interference alignment is also present in the product-matrix MSR code, and Appendix [C| 
brings out this connection. 
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A. An Example for the Product-Matrix MSR code 

Let n = 6, /c = 3, d = A. Then a = d — k + l = 2 and B = ka = 6. Let us choose q = 13, so we are operating 
over F13. The matrices 5i and S2 are filled up by the 6 message symbols {ujjf^^ as shown below: 



Si 



so that the message matrix M is given by 



Ul 
U2 



U2 
U3 



, S2 



-U5 



n5 



Ul 


U2 


U2 


U3 


U4 


U5 


U5 


Uq_ 



M 



We choose ^ to be the (6 x 4) Vandermonde matrix over F13 given by 

1 



(40) 



(41) 



^ 



1 


1 


2 


4 


3 


9 


4 


3 


5 


12 


6 


10 



1 

12 



(42) 



Hence the (6 x 2) matrix <1> and the (6 x 6) diagonal matrix A are 



$ 



"1 1" 




1 2 




1 3 

1 4 


, A = 


1 5 




1 6 





12 



10 



(43) 



Fig. [2] shows at the top, the (6 x 2) code matrix C = ^M with entries expressed as functions of the message 

symbols {uj}. The rest of the figure explain how exact-regeneration of failed node 1 takes place. To regenerate 

node node 1, the helper nodes (nodes 2, 4, 5, 6 in the example), pass on their respective inner products ijj^ M[\ 1]* 

-1 *- 

for ^ = 2, 4, 5, 6. The replacement node multiplies the symbols it receives with ^j-ep^ij, where 



*n 



and decodes Siij) and S2'ip'- 



Siti 





1 2 4 


8 






1 4 3 


12 




air — 


1 5 12 


8 


•} 




1 6 10 


8_ 




Ui +U2 
U2 + Uz 


.S2±^ = 


Ua 

U5 





(44) 



(45) 



Finally, it processes Siip and S21P-. to obtain the data stored in the failed node as explained in the proof of 
Theorem |4] above. 



B. Systematic Version of the Code 



It was pointed out in Section III that every exact-regenerating code has a systematic version and further, that 
the code could be made systematic through a process of message-symbol remapping. In the following, we make 
this more explicit in the context of the product-matrix MSR code. 

Let ^fc be the {k x d) submatrix of ^, containing the k rows of ^ corresponding to the k nodes which are 
chosen to be made systematic. The set of ka symbols stored in these k nodes are given by the elements of the 
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Model 



u^+ U2+ u^U^ »- "=U5I U3+ U5+ Ug 



Node 2 U1+2U2+4U4+8U5 U2+2u3+4u5+8ug 



Node 3 u^+ 3u2+ 9u4+ Ug 



U2+ 3U3+ 9Ug+ Ug 



Replacement 
Node1 



Ui+ U2+ U4+ Ug 



U2+ U3+ U5+ Ug 



<',1> 



Node 4 Ui+4u2+ 3U4+ I2U5 U2+ 4U3+ 3U5+I2 Ug ^''1^ ^ 



Nodes Ui+ 5U2+ I2U4+ 8U5 U2+ 5U3+ I2U5+ 8ug _fll^_^ 



Node 6 Ui+ 6U2+ 1 0U4+ 8ug U2+ 6U3+ 1 0U5+ 8ug '^''P' y 



1 2 4 

1 4 3 12 

1 5 12 8 

1 6 10 8 



-,-1 



u^+ U2 
U2+U3 
U4+U5 

Us+Ug 



Fig. 2: An example for the MSR code construction: On failure of node 1, the replacement node downloads one 
symbol each from nodes 2, 4, 5 and 6, using which node 1 is exactly regenerated. The notation < ., 1 > indicates 
an inner product of the stored symbols with the vector [1 1]*. 



{k X a) matrix ^^M. Let U he a. {k x a) matrix containing the B = ka source symbols. We map 

*feM = U , (46) 

and solve for the entries of M in terms of the symbols in U. This is precisely the data-reconstruction process 
that takes place when a data collector connects to the k chosen nodes. Thus, the value of the entries in M can 
be obtained by following the procedure outlined in Theorem [5] Then, use this M to obtain the code C = ^M. 
Clearly, in this representation, the k chosen nodes store the source symbols U in uncoded form. 



C. Explicit MSR Product-Matrix Codes for d >2k — 2 

In this section, we show how an MSR code for d = 2A; — 2 can be used to obtain MSR codes for all d > 2k — 2. 
Our starting point is the following theorem. 

Theorem 6: An explicit [n' = n + 1, k' = k + 1, d' = d + l] exact-regenerating code C that achieves the cut-set 
bound at the MSR point can be used to construct an explicit [n, k, d] exact-regenerating code C that also achieves 
the cut-set bound at the MSR point. Furthermore if d' = ak' + b in code C, d = ak + b -\- {a — 1) in code C. If C 
is linear, so is C. 

Proof: If both codes operate at the MSR point, then the number of message symbols B' ,B in the two cases 
must satisfy 

B' = k'{dl-k' + l) and B = k{d - k + 1) 



respectively, so that 



B' -B = d-k + l = a 



We begin by constructing an MSR-point-optimal [n', k', d'] exact-regenerating code C in systematic form with the 
first k' rows containing the B' message symbols. Let C" be the subcode of C consisting of all code matrices in C 
whose top row is the all-zero row (i.e., the first a of the B' message symbols are all zero). Clearly, the subcode C" 
is of size q^'^" = q^ . Note that C" also possesses the same exact-regeneration and data-reconstruction properties 
as does the parent code C'. 

Let the code C now be formed from subcode C" by puncturing (i.e., deleting) the first row in each code matrix of 
C". Clearly, code C is also of size q^. We claim that C is an [n, k, d] exact-regenerating code. The data-reconstruction 
requirement requires that the B underlying message symbols be recoverable from the contents of any k rows of a 
code matrix C in C. But this follows since, by augmenting the matrices of code C by placing at the top an additional 
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all-zero row, we obtain a code matrix in C" and code C" has the property that the data can be recovered from any 
(/c + 1) rows of each code matrix in C". A similar argument shows that code C also possesses the exact-regeneration 
property. Clearly if C is linear, so is code C. Finally, we have 

d' = ak' + b 
^ d+1 = a{k + l) + b 

^ d = ak + h+ {a-l) . 

■ 

By iterating the procedure in the proof of Theorem [6] above i times we obtain: 

Corollary 7: An explicit [n' = n + i, k' = k + i, d' = d + i] exact-regenerating code C that achieves the cut-set 
bound at the MSR point can be used to construct an explicit [n, k, d] exact-regenerating code C that also achieves 
the cut-set bound at the MSR point. Furthermore if d' = ak' + 6 in code C , d = ak + b + i{a — 1) in code C. If 
C is linear, so is C. 

The corollary below follows from Corollary [7] above. 

Corollary 8: An MSR-point optimal exact-regenerating code C with parameters [n, k, d] for any 2k — 2 < d < 
n — 1 can be constructed from an MSR-point optimal exact-regenerating [n' = n + i, k' = k + i, d' = d + i] code 
C with d' = 2k' -2 andi = d-2k + 2. If C is Unear, so is C. 

VI. Analysis and Advantages of the Codes 

In this section, we detail the system-implementation advantages of the two code constructions presented in the 
paper. 

A. Reduced Overhead 

In the product-matrix based constructions provided, the data stored in the i*'* storage node in the system is 
completely determined by the single encoding vector ip. of length d. This is in contrast to a (i? x a) generator 
matrix in a general code, comprising of the a global kernels of length B, each associated to a different symbol stored 
in the node. The encoding vector suffices for the encoding, data-reconstruction, and regeneration purposes. The 
short length of the encoding vector reduces the overhead associated with the need for nodes to communicate their 
encoding vectors to the data collector during data-reconstruction, and to the replacement node during regeneration 
of a failed node. 

Also, in both MBR and MSR code constructions, during regeneration of a failed node, the information passed on 
to the replacement node by a helper node is only a function of the index of the failed node. Thus, it is independent 
of the identity of the d — 1 other nodes that are participating in the regeneration. Once again, this reduces the 
communication overhead by requiring less information to be disseminated. 

B. Applicability to Arbitrary n 

In any real-world distributed storage application such as peer-to-peer storage, cloud storage, etc, it is natural that 
the number of nodes may go up or down: in due course of time, new nodes may be added to the system, or multiple 
nodes may fail or exit the system. For example, in peer-to-peer systems, individual nodes are free to come and go 
at will. The existing, explicit constructions of exact-regenerating codes ||6|, ||7|, ||ll;j-[13] restrict the value of n to 
be d+1. On the other hand, the codes presented in this paper are applicable for all values of n, and independent 
of the values of the parameters k and d. This gives a practical appeal to the code constructions presented here. 

C. Complexity 

1) Linearity and Field Size: The codes are linear over a chosen finite field Fg, i.e., the source symbols are 
from this finite field, and any stored symbol is a linear combination of these symbols over ¥q. To arrive at the 
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product-matrix MBR code, any field of size 2n or liiglier suffices, and for tlie product-matrix MSR code, any field 
of size r? or higher suffices. By cleverly choosing the matrix ^ that meets the conditions governing the respective 
codes, it may often be possible to reduce the field size even further. 

2) Striping: The codes presented here divide the entire message into stripes of sizes corresponding to /3 = 1. 
Since each stripe is of minimal size, the complexity of encoding, data-reconstruction and regeneration operations, 
are considerably lowered, and so are the buffer sizes required at data collectors and replacement nodes. Furthermore, 
the operations that need to be performed on each stripe are identical and independent, and hence can be performed 
in parallel efficiently by a GPU/FPGA/multi-core processor. 

3) Choice of the Encoding Matrix ^.- The encoding matrix ^, for both the codes described, can be chosen as a 
Vandermonde matrix. Then each encoding vector can be described by just a scalar. Moreover with this choice, the 
encoding, data-reconstruction, and regeneration operations are, for the most part, identical to encoding or decoding 
of conventional Reed-Solomon codes. 

VII. Conclusions 

In this paper, an explicit MBR code for all values of the system parameters [n, k, d], and an explicit MSR code 
for all parameters satisfying [n, k, d > 2k — 2] are presented. Both constructions are based on a common product- 
matrix framework introduced in this paper, and possess attributes that make them attractive from an implementation 
standpoint. To the best of our knowledge, these are the first explicit constructions of exact-regenerating codes 
that allow n to take any value independent of the other parameters; this results in a host of desirable properties 
such as the ability to optimally handle multiple simultaneous, node failures as well as the ability of allowing the 
total number of storage nodes in the system to vary with time. Our results also prove that the MBR point on the 
storage-repair bandwidth tradeoff is achievable under the additional constraint of exact-regeneration for all values 
of the system parameters, and that the MSR point is achievable under exact-regeneration for all d > 2k — 2. 
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APPENDIX A 
DESCRIPTION OF A PREVIOUSLY CONSTRUCTED IMSR CODE IN THE PRODUCT-IVIATRIX FRAMEWORK 

A code structure that guarantees exact-regeneration of just the systematic nodes is provided in |[6|, for the lUSR 
point with parameters [n = d+1, k, d> 2k— 1]. Subsequently, it was shown in ITl that for this set of parameters, the 
code introduced in [6] for exact-regeneration of only the systematic nodes can also be used for exact-regeneration 
of the non-systematic (parity) nodes, provided repair construction schemes are appropriately designed. Such an 
explicit repair scheme is indeed designed and presented in ||7|. In this section, we provide a simpler description of 
this code in the product-matrix framework. 

As in ||6|, |[7|, we begin with the case d = 2k — I, since the code as well as both data-reconstruction and 
exact-regeneration algorithms can be extended to larger values of d by making use of Corollary [8] 

At the ]VISR point, with d = n — 1 = 2k — 1, we have from equations (|5]l and Q that 



a 
B 



d-k + l 
ka = k . 



(47) 
(48) 



Let S he 3. {k X k) matrix whose entries are precisely the B message symbols {ui}^^^ and let M be the {2k x k) 
message matrijy given by: 

S 



M -- 
Next, let <I> be a (A; X A;) Cauchy matrix over ¥g and p a scalar chosen such that 

p / 0, pV 1 • 

Let * be the (n x 2k) encoding matrix given by 

'I 



^ 



<I> p^ 



(49) 



(50) 



(51) 



The code constructed in |[6|, |[7| can be verified to have an alternate description as the collection of code matrices 

of the form 

S 



C =^M 



^S + pS^ 



(52) 



Note that the first k nodes store the message symbols in uncoded form and hence correspond to the systematic 
nodes. A simple description of the exact-regeneration and data-reconstruction properties of the code is presented 
below. 



''Note that the constructions presented in Sections IV and [Vj employ a (d x a) matrix M as the message matrix, whereas the dimension 
of M in the present construction is ((d + 1) x a). 
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Theorem 9 (Exact-Regeneration): In the code presented, exact-regeneration of any failed node can be achieved 
by connecting to the remaining n — 1 nodes. 

Proof: In this construction, the vector /i ^ used in the exact-regeneration of a failed node / is composed of the 
first k = a symbols of ip 



-f 



/■ 



a) Exact-regeneration of systematic nodes: Consider regeneration of the i*^ systematic node. The k symbols 
thus desired by the replacement node are 6*5*. The replacement node obtains the following n — 1 symbols from the 
remaining nodes: 



/ 



ISe,i 



(53) 



where / is a {{k — 1) x k) matrix which is the identity matrix with i*'* row removed. Since $ is full rank by 
construction, the replacement node has access to 



= [p4s + 4s' 



From (53 1 and (54i, we see that the replacement node has access to 







p^ 



(54) 



(55) 



Since p 7^ 1, the (2k x 2k) matrix on the left is non-singular. This allows the replacement node to recover the 
symbols 5*ej, which are precisely the set of symbols ejS desired. 

b) Exact-regeneration of non-systematic nodes: Let (/>* be the row of $ corresponding to the failed node. 
Then the k symbols stored in the failed node are cf)^ AS + pS^). The replacement node requests and obtains the 



following n — 1 symbols from the remaining nodes: 

/ 



-/ 




' s' 
5* 



-/ ' 



(56) 



where ^k-i is the submatrix of $ containing the k — 1 rows corresponding to the remaining non-systematic nodes. 

(57) 



This gives the replacement node access to S4> and therefore to 



(50pV^ 



'-f'% ■ 



Hence the replacement node has access to 



/ 

0* 



■ 

P^k-i 



-/• 



(58) 



The matrix on the left is easily verified to be non-singular and thus the replacement node acquires Scj) and 5*(/) 
individually from which it can derive the desired vector ((/>* 5* + pcj)^ S^). ■ 

Theorem 10 (Data-Reconstruction): In the code presented, all the B message symbols can be recovered by 
connecting to any k nodes, i.e., the message symbols can be recovered through linear operations on the entries of 
any k rows of the matrix C. 

Proof: We first introduce the following notation to denote submatrices of a matrix. If A is an (mi x 7712) 
matrix and P, Q are arbitrary subsets of {1, ... , nii] and {!,..., 711,2} respectively, we will use ^(p,q) to denote 
the submatrix of A containing only the rows and columns respectively specified by the indices in P and Q. For 
the cases when either P = {1, . . . , mi] or Q = {1, . . . , 7712}, we will simply indicate this as 'all'. 

Let P = {ni, . . . ,ni} and Q = {m-i, . . . ,m(;j,_j)} be the systematic and non-systematic nodes respectively to 
which the data collector connects. Let T = {!,..., k}\P, i.e., the systematic nodes to which the data collector 
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does not connect. Then the data collector is able to access the ka symbols 

•S'(P,all) 1 .cQx 

Thus the data collector has access to the i rows of S indexed by the entries of P and consequently, has access to 
the corresponding columns of 5* as well. 

Consider the i columns of ^(Q,aii)('S' + pS^) indexed by P. Since the entries of these columns in 5* are known, 
the data collector has access to *l'(Q,aii)5'(aii.p) • Now since the i rows of S indexed through P are also known, the 
data collector has thus access to the product 

^iQ,T)S(T,P) ■ (60) 

Now as ^((5,T) is non-singular, being a (A; — |P|, k — \P\) sub-matrix of a Cauchy matrix, the data collector can 
recover S(^j',P)- In this way, the data collector has recovered all the entries in the rows of S indexed by P as well 
as all the entries in the columns of S indexed by P. Clearly, the same statement holds when S is replaced by 5*. 
Thus the data collector has access to the product: 

$(T,T)(^ + P'9*)(T,T) • (61) 

Again $(T,r) being a sub-matrix of a Cauchy matrix is of full rank and enables the data collector to recover 
(5 + /o5'*)(Tr)- It is easy to see that from the diagonal elements of this matrix, all the diagonal elements of /Sct.t) 
can be obtained. The non-diagonal elements are however of the form Sij+pSji and Sji+pSij for I € T, j ^ T, I y^ j. 
As p^ / 1, all the non-diagonal elements of S(^t,t) can also be decoded. In this way, the data collector has recovered 
all the B entries of S. ■ 

Appendix B 
Equivalent Codes and Conversion of Non-Systematic Codes to Systematic 

In this section, we define the notion of 'equivalent codes', and show that any exact-regenerating code is equivalent 
to a systematic exact-regenerating code. 

Given any linear exact-regenerating code, one can express each of the na symbols stored in the nodes as a linear 
combination of the B message symbols {uj}^^. Let {cij\l < i < n, I < j < a} denote the j*^ symbol stored in 
the i^'^ node. Thus we have the relation: 

[Ui U2 ■■■ Ub][Gi G2 ■■■ Gn] = [Cn ■ ■ ■ Cla\c21 ■ ■ ■ C2a\ \Cnl ' ' ' Cna] , (62) 

where the {B x na) block generator matrix G = [Gi G2 • • • Gn] is composed of the n component generator 
sub-matrices 

<^i = [ l^l 9i2 ■■■ li^ ] , 

each of size {B x a), and associated to a distinct nodePJLet Wi denote the column-space of Gi. A little thought 
will show that a distributed storage code is an exact-regenerating code iff 

1) for every subset of k nodes {ij | 1 < j < k}, 

dim{Wi,+Wi, + --- + W^,) = B , 

and 

2) for every subset of {d + 1) nodes {ij \ I < j < {d + 1)}, the subspaces {Wi.}'^^^ contain a vector Wj such 
that 

We can thus define two exact-regenerating codes to be equivalent if the associated subspaces { Wj}"^^ are identical. It 
is also clear that two codes are equivalent if one can be obtained from the other through a non-singular transformation 

^In the terminology of network coding, the (B x 1) column vector g . . is termed the j"* global kernel associated to the i*'' node. 
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of the message symbols and the symbols stored within the nodes. With these two observations, it follows that two 
codes with generator matrices having the following relation are equivalent: 



G, and XG 



Yi 



Y2 



Y„ 



where the {B x B) pre-multiplication matrix X, and the {na x no) post-multiplication block diagonal matrix 
comprising of the (a x a) matrices {Yi}'^^-^, are non-singular. Clearly, equivalent codes have identical data- 
reconstruction and regeneration properties. 

Systematic Version of Exact-Regenerating Codes: It also follows that any exact-regenerating code is equivalent 
to a systematic, exact-regenerating code. To see this, suppose the set of k nodes to be systematic are the first k 
nodes. Let 

'5„ 



{g„ ,g„ , 



} Q {5,, \l<i<k, 1 < j < a} 



denote a set of B linearly independent column vectors drawn from the generator matrices of the first k nodes 
[Gi • • • Gk]- That such a subset is guaranteed to exist follows from the data-reconstruction property of a regenerating 
code. Let G be the [B x B) invertible matrix 



Then we have the relation: 



G 



Ml U2 



"ai "02 



—0.B 



ub\G 



(63) 



where {cai]f^i i^ ^^^ corresponding set of code symbols. It follows that if we wish to encode in such a way that 
the code is systematic with respect to code symbols {ca^^f^^i, the input to be "fed" to the generator matrix G is 



\Ul U2 



ub]G-^ ■ 



Appendix C 
Interference Alignment in the Product-Matrix MSR Code 



The concept of interference alignment was introduced in |21 1, p2| in the context of wireless communication. This 
concept was subsequently used to construct regenerating codes in |[6|, |[7|, pT| , 1 14 1. Furthermore, ||6|, 1 14 1 showed 
that interference alignment is in fact, a necessary ingredient of any linear MSR code. Since the product-matrix 
MSR construction provided in the present paper does not explicitly use the concept of interference alignment, a 
natural question that arises is how does interference alignment manifest itself in this code. We answer this question 
in the present section. 

Consider repair of a failed node (say, node /) in a distributed storage system employing an MSR code, and let 
nodes {1, . . . , d} be the set of d helper nodes. Recall that (from equation Q), at the MSR point we have B = ka. 
Further, since all the B message symbols should be recoverable from any subset of k nodes, it must be that any 
subset of k nodes does not store any redundant information. Let Cj, 1 < i < n, be an a-length vector denoting 
the a symbols stored in node i. Then, from the above argument, it is clear that any symbol in the system can be 
written as a linear combination of the B symbols in {ct, c^, . . . , cii._i\}. 

Let Oi, k < £ < d, denote the symbol passed by node i to assist in the repair of node /. Then we can write, 

fc-i 
Oi = c}v^j + ^dvi^i (64) 



for some vectors t)^ ^ and u^ r each of length a. The symbols in {ct, c 



/' 



-1) 



<k-l) 



i\} have no redundancy 



among themselves. Thus, the components comprising of {c^, . . . , Cfi^_i\} are undesired and hence are termed as 
interference components, and the component comprising of Cj- is termed the desired component. 
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It is shown in Q, p4| that for any MSR code, it must be that for every i G {1, . . . , k — 1}, the set of vectors 

{v£,^k<i<d} (65) 

are aligned (i.e., are scalar multiples of each other). 

The following lemma considers the repair scenario discussed above to illustrate how interference alignment arises 
in the product-matrix MSR code presented in Section [V] 

Lemma 11: For every helper node i, k < i < d, there exist scalars {a(£j)|l < i < k — 1} and an a-length 
vector b^ = [bi^i ■ ■ ■ bi^aY such that 

fe-i 
^M0^ = i/^Mb, + Y^ a,,^^M^^ . (66) 



i=l 



Proof: Re-writing the symbols passed by the helper node j (1 < j < d), 



tM4>, 



* Si + A,0*,52 



9^0l -h Aj<p_^ 

= "^^Si + Xfl^S^ . + (A,- - A/)0*^52^ . 

where equation ( |68] ) follows from the symmetry of matrices 5i and ^2. By construction, the values of the scalars 
{Aj I 1 < J < n} are distinct, which allows us to write 



(67) 

(68) 

(69) 
(70) 



#^52^. = (A, - A/)-^(V^*M0^ - ^}M^p . 



(71) 



Also, since the (k — \ =)a-length vectors {<^. | 1 < i < A; — 1} are linearly independent, iox k < I <d, there exist 
scalars {at^i | 1 < « < A; — 1} such that 



fc-i 



(72) 



From equations ( fTO] ), ( fTTj ) and (|72]), for any i G {/c, . . . , d}, we can write 



(73) 
(74) 



fc-i 
^^M<^ + (A, - A/) J] a,,i^*^52^^ 

4 = 1 
/ fe-1 \ fc-l 

V;*^M ^^ - (A, - A/) J] a,,,(Ai - A/)- V^ + J] [a^AM - A/)(A. - A/)-i) {^M<^^) (75) 

\ i=l / j=l 



where equation (74i follows from (72 1, and equation (75 1 follows from (71 1. 



